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Abstract 

The measurement of the underlying event (UE) and its separation from hard interac- 
tions in hadron-cohider events is a conceptuaUy and practicaUy challenging task. We de- 
velop a simple, mostly analytical toy model for the UE in order to understand how different 
UE measurement approaches fare on the practical aspects of this problem, comparing the 
traditional approach used so far at Tevatron with a recently proposed "jet-area/median" 
approach. Both are found to perform comparably well in measuring average properties of 
the UE, such as the mean transverse momentum flow, but the jet-area/median approach 
has distinct advantages in determining its fluctuations. We then use the jet-area/median 
method to investigate a range of UE properties in existing Monte Carlo event-generator 
tunes, validating the main results of the toy- model and highlighting so- far unmeasured 
characteristics of the UE such as its rapidity dependence, as well as its intra- and inter- 
event fluctuations and correlations. 
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1 Introduction 

The "underlying event" (UE) in high-energy hadron-hadron collisions can be thought of as 
the low transverse momentum (pt) part of the event activity that is not naturally associated 
with the hard interaction. 

Despite being a low-pt phenomenon, the underlying event has a large impact on high-pt 
physics at hadron colliders. For example in measurements of the inclusive jet spectrum at 
Pt ~ 50 GeV it can affect the result by up to 50% [1] . It also biases kinematic reconstructions 
(for example in top mass measurements), degrades their resolution and affects the efficiency 
of isolation criteria that enter into the experimental identification of particles like photons 
and electrons. It has even been suggested that certain new physics scenarios might show up 
in "anomalous" characteristics of the underlying event [2j . For these reasons it is important 
to have a good understanding of its properties. 
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Figure 1: Views of a hard scattering plus UE; (a) shows multiple 2—7-2 interactions, as 
incorporated in the most successful models; (b) illustrates how a collinear splitting in the 
initial-state can lead to correlation between the partons involved in a double 2 — )• 2 scattering; 
from (c) one sees the similarity with a perturbative 1-loop 2 — )■ 4 diagram; and (d) represents 
a BFKL-inspired picture for the UE. 

The purpose of this article is to investigate some different ways in which the underlying 
event can be measured and/or constrained experimentally. Such measurements enter into 
tunes of Monte Carlo models of the UE [3Vtl0j . They can also serve as an input to analytical 
methods of accounting for the average UE correction to a jet's transverse momentum |lHll2j 
and to approaches that correct for the UE on a jet-by-jet and event-by-event basis [13] . as 
well as for related work that seeks to optimise jet definitions. 

A difficulty in discussing the measurement of the UE is that there exists no good definition 
of what the UE actually is, or how to distinguish it, in a conceptually unambiguous way, from 
the hard interaction. For instance, the most successful phenomenological models of the UE 
involve multiple 2 — )• 2 scattering as in fig. [1^ ( "multiple parton interactions" — MPI) . Very 
simply, they supplement the one hard interaction in the event with multiple other lower-pt 
interactions, whose multiplicity is determined by the 2—7-2 cross section, regulated with a 
\ow-pt cutoff of the order of a couple of GeV. Fig. [T^ can only be part of the picture because 
some of the partons entering multiple 2 — )• 2 scatterings are necessarily correlated, e.g. due 
to energy conservation (cf. ref. |14j). or because they can have a common origin from an 
initial-state collinear splitting, as in fig. [TJa. However the contribution of fig. [TJa is itself also 
part of the 2 — )• 4 1-loop scattering diagram fig. [TJ; (as discussed for example in [E]), which 
is relevant as a N^LO correction to the dijet cross section or at N^LO in its interference with 
tree-level 2 — )• 4 scattering. This means there are non-trivial questions of double-counting 
between multiple parton interaction and perturbative higher orders. In addition, the radiation 
that fills the event is not bound to come just from 2—7-2 scatterings, but may also arise from 
BFKL type configurations which can involve (multiple) chains with \aw-pt emissions spread 
in rapidity, as in fig. [T]l (some work towards modelling this while retaining consistency with 
the total cross section is to be found in [16j). Though these ways of viewing the UE represent 
just a subset of the diversity of physics considerations that are of potential relevance for its 
modelling (more detailed reviews are to be found in Refs. fl0','17]), they do illustrate the 
difficulties that arise in ascribing unambiguous physical meaning to it. 

Given this complexity in discussing what the UE might be, how are we to go about 
measuring it? A feature present in most models is that, on some low pt scale, UE activity fills 
the whole event. One way then of characterising the UE is to say that it is whatever physical 
effect fills most of the event with radiation. To help understand what implications this picture 
has for UE measurements, we shall develop (section [3|) a semi- analytical two-component toy 



2 



model: one component will be pm'ely soft and dispersed across the event, corresponding to the 
UE, while the second component will involve the hard scattering and a simple approximation 
for the perturbative radiation with which it is associated. 

Though our toy model is undoubtedly too simple to fully reflect reality, the fact that we 
know exactly what goes into it will make it quite powerful: it will, for example, allow us 
to take different UE-measurement approaches and examine to what extent their results are 
affected both by the radiation associated with the hard scattering, and by the techniques used 
to limit that hard contamination. This can be investigated both for averaged quantities, and 
for event-by-event extractions of information about the UE. A number of the results will be 
given in analytical form, in terms of the characteristics of the UE and the hard scattering and 
of the parameters of the measurement methods. This will give insight into the compromises 
that arise when measuring the UE, especially when extending existing methods to the greater 
phase space that is available at LHC relative to Tevatron. 

The two UE measurement methods that we shall investigate are both reviewed in section [2j 
One, which we call the "traditional" approach (see for example |18|ll9j ). is currently the default 
approach for most UE studies. Another, the jet-area/median based approach of |12pi3j. was 
originally developed for evaluating pp pileup (and backgrounds in heavy-ion collisions, as used 
for example in [20]), but may also have benefits for UE studies. The basic results for the two 
approaches will be derived in section [3l Some readers may prefer to skip most of this section 
and read just the final summary of these results, as presented in section \3M 

For the purpose of determining the quantity that we call p, the UE's mean transverse 
momentum {pt) flow per unit rapidity-azimuth area, both methods will turn out to have 
systematics that are under control and of similar magnitude, at about the 20% level (except 
at high pt for the traditional method). However it is also important to have knowledge 
of fluctuations of the underlying event (both intra and inter-event). Since it is the jet- 
area/median method that will prove to be the more robust tool for measuring them, it is 
this method that we will use when, in section [H we look at a range of possibly interesting 
measurements of the UE. They will be carried out on Monte Carlo events simulated with 
Pythia 6.4 [21j and with Herwig 6.5 [22] with Jimmy 4.3 [3]. This part of the study will help 
validate the understanding developed with the toy-model, and illustrate determinations of 
the UE average pt flow as a function of rapidity, its event-to-event fluctuations, its intra-event 
fluctuations, and the degree of intra-event correlation. These observables go beyond the kinds 
of measurements that are commonly discussed for Tevatron or envisaged so far for the LHC 
and, as we shall see, there will be substantial differences between Jimmy and Pythia UE tunes 
on a number of them. 

2 Overview of measurement approaches 
2.1 Traditional approach 

The main current approach [18j to measuring the properties of the underlying event involves 
considering the central part of the detector, say \7]\ < 1, where the pseudorapidity rj is defined 
as = — In tan |. One tags events based on the presence of a jet (whose direction defines an 
azimuthal angle i;^ = 0), and divides the central part of the event into four blocks in azimuth: 
the "towards" region, typically < it/3, an away region 27r/3 < < vr, and two transverse 
regions, covering it/3 < < 2tt/3. This is illustrated in fig. [2] (left). Since the trigger and 
recoil jets will usually occupy the towards and away regions, one then restricts one's attention 
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Figure 2: Illustration of underlying-event measurement methods. Left: representation of 
TransMin and TransMax regions of the rapidity azimuth plane in the traditional approach. 
Right: representative distribution of Ptj/Aj for the set of (real and ghost) jets in a single 
event, as used to determine p and a in the area/median approach. 

to the two transverse regions. There one measures the multiplicity of charged tracks above 
some transverse-momentum threshold as well as the total transverse momentum contained in 
the charged tracks (sometimes normalised per unit area, dr]d(p). The results for the charged 
track multiplicity and charged pt flow are usually presented as averages across many events, as 
a function of the pt of the leading jet. One also sees measurements of the charged momentum 
flow as a function of the multiplicity. 

Since there is a probability of order as that at least one of the transverse regions is 
contaminated by perturbative radiation from the dijet event, which substantially affect the 
extracted information about the UE's pt flow. To work around this, it is usual to label the 
two transverse regions as TransMin and TransMax, respectively the less and more active of 
the two. The largest component of perturbative contamination should be restricted to the 
TransMax region, while TransMin should be less affected. 

In the earliest variants of this "two-region" method |23p24j. the two regions used for 
sampling the UE were actually placed at non-central rapidities, rather than central rapidities 
and transverse azimuth, and it was the total transverse energy flow that was considered 
rather than just its charged component. Another variant measured charged momentum flow 
in cones [25]. These differences reflect the freedom inherent to this method: the question of 
where to place the "transverse" regions, and the choice of their shape and size. In the rest 
of this article we will always assume that the transverse regions are well separated from the 
dijet system in an event, as shown in fig. [5] (left), and we will quote our results as a function 
of the area Axrans of each of the transverse regions. 

2.2 Jet-area/median approach 

In the jet-area/median approach, one first clusters the event with a Cambridge/Aachen (C/A) 
[SSIET] or inclusive kt [MIEH] type jet algorithm. To each jet, j, one attributes an "active" 
jet-area, Aj, as described in more detail in [T^]. This is calculated by adding a large number 
of "ghost" particles to the event (each with negligible pt ~ 10^^*^*^ GeV) and including them 
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in the clustering. The area of a jet is then proportional to number of ghosts it contains. Some 
jets contain just ghost particles ("pure ghost jets") and are considered to have pt = 0. 

A proposal [13] for a way to measure the level p of uniform background noise in an event 
is then to take it to be the median of the distribution of the ptj/Aj for the ensemble of jets 
in that event 



median 

jejets 



(1) 



as shown schematically in fig. [2] (right). The logic of the use of the median is that it is much 
less susceptible to contamination from outliers (i.e. hard perturbative jets) than the mean. 
In addition to measuring p one can also determine the intra-event fluctuations of the UE. We 
introduce a quantity a, defined such that a fraction X/2 of jets satisfy 



'^/^/W) <Pt3/^j < (2) 



where X = Erf(l/\/2) — 0.6827 is the fraction of a Gaussian distribution within one standard 
deviation of the mean. The approach to measuring a is analogous in spirit to the use of 
the median for determining p. As shown in fig. [2] (right), it is one sided (i.e. just considering 
jets with ptj/Aj < p). This choice has been made so as to limit contamination of a from hard 
jets when the total number of jets is small. 

This method was originally suggested in [13] as a way of measuring average pileup noise 
across an entire event (e.g. for \y\ < 5). One difficulty when using it for the UE will come 
from the fact that the UE is significantly softer, and has relative fluctuations that are larger 
and less Gaussian than those in (say) 20 pileup minimum-bias events, both consequences of 
the lower density of particles. 

Another point is that in measuring the UE it is important to obtain differential information 
on the UE's rapidity (y = In ^^^) dependence. This leads us to consider 



p{y) = median 

jr'ejets, \yj-y\<5y 



Ptl 



(3) 



where 25y is the width of a rapidity window in which one carries out the measurement. The 
fact that one has a relatively limited number of jets in a given rapidity window will be one 
further challenge that we will face in using this method for studying the UE, because the 
relative impact of the presence of a hard jet in the region of interest is amplified by the small 
total number of jets. A modification of the method that helps address these difficulties is 
"hard jet removal," first employed in the STAR collaboration's [30] use of the techniques 
of [T3] for estimating the (very large) UE for jet measurements in heavy-ion collisions. We 
will investigate the impact of this modification here for events with dijet topologies, in which 
case we will simply remove the two hardest jets in the event from the overall list of jets. 

A choice that is present in both pileup and UE measurements with the area/median 
method is that of the jet algorithm and jet radius R. Both the kt [281 [29] ^'^^ Cam- 
bridge/Aachen [261127] algorithms are suitable options, because they produce jets whose area 
distribution is quite regular. In contrast, algorithms that give mostly conical jets (like anti- 
kt [31] and, to a lesser extent, SISCone [32J) tend not to be, because they fill in the "holes" 



^This definition is such that, if the noise distribution is Gaussian and uncorrelated from point to point 
within the event, then a patch of area A will have an average noise pt contamination of pA and standard 
deviation oyfA. 
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between the cones with jets with very small areas, which can have unrepresentative pt/A 
values. The question of what R value to use is one of the freedoms of the method and will be 
discussed in the coming section. 



3 A toy model 

To understand the strengths and weaknesses of different UE-measurement approaches, it is 
helpful to consider events as consisting of two components: a low-p^ ("soft") noise component, 
defined to be the UE, supplemented with hard jets from a perturbative scattering and asso- 
ciated higher-order corrections. We will introduce models for each of these two components 
and investigate how the methods behave when either of the two components is present alone 
and when both are present together. Our guiding principle in designing these models has 
been to keep them sufficiently simple as to be treatable analytically, while also maintaining a 
reasonable degree of realism. While our combined hard and soft models will not quite have 
the continuous transition between hard and soft components that is present in Monte Carlo 
MPI models, we will see that they nevertheless lead to certain signature behaviours in UE 
measurement methods that correspond nicely to what is observed in Monte Carlo simulations. 



3.1 Low-pt component 

As a simple model for the underlying event let us imagine that on average in a patch of unit 
of area there are v particles, that the probability distribution for the number of particles n 
in a specific patch of area A follows a Poisson distribution, P„ = {yA)'^e~'^^/n\, and that the 
single-particle transverse-momentum probability distribution is given by 

1 dPi 1 



Pi dpt /X 



e-P*/^ , (4) 



where /i is the mean transverse momentum per particle. This particular form has been chosen 
mainly because it will allow us to carry out analytical calculations^ If a patch contains n 
particles then the probability distribution for its pt is given by 



^ dPn _ 1 pI 



n-1 

e-P'l^ . (5) 



Pn dpt (n-1)! ^ 

For a patch of area A, summing over the Poisson distribution for the number of particles in the 
patch ((n) = uA) gives us the overall probability distribution for the transverse momentum 
in the patch as 

= S{p,]e-''^ + e-''^-"'>-,!—h I 2,/^ I , (6b) 




^Another simple variant will be considered in appendix |^ It would also be interesting to consider a 
distribution suppressed as 1/p" at large pt, more in line with the scaling that is to be found with multiple 
parton interactions. However, we have not found a form with this property that can be handled analytically 
throughout the calculation. Given that the results from the toy model will reproduce many features that we 
see in Monte Carlo simulation, we believe that the form we are using is adequate for our purposes. 
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where Ii is the (first order) modified Bessel function of the first kind. The mean and standard 
deviation of the distribution are given by uAfi and \/2uAfi. It is convenient to express this 
as saying that the transverse momentum in a patch of area A is 

Pt{A) = pA±aVA, (7) 

where, in the model discussed here, p and a are given by 

p = up , (8a) 

a = V2^p = p^. (8b) 

In the limit in which vA ^ 1, the distribution in eq. (jG]) tends to a Gaussian with mean and 
standard deviation as given in eqs. ([7]) and ([8]). This is a consequence of the central- limit 
theorem. 



Traditional approach. Taking the area of each of the transverse regions to be ^xransj the 
traditional approach will extract the following results for p in the transverse Average, Min 
and Max regions 

(Pext,Av) = P , (9a) 

(^'ext.Min) = P , f , (9b) 

V TT^Trans 

(/Jext.Max) = P+ , ^ , (9c) 

V^^Trans 

where the Min and Max results have been derived using the Gaussian limit of eq. Q. The 
only one of the above results that correctly estimates p is the average. The Min and Max 
results tend slowly to the correct answer in the limit in which ^xrans is large. 

In the literature (e.g. [18j). ^Trans has usually been taken equal to 27r/3. As we shall see 
in section HI a typical value for a at the LHC {^/s = lOTeV) is a ~ 0.5p — 0.75p, which 
implies that the o^/^/^^A^^^ term is about 20 — 30% of p. 



Area/median-based approach. To help understand the behaviour of the area/median- 
based approach, let us replace the jets (which have a range of areas) with uniform rectan- 
gular tiles, each of which has a fixed area ^tiie- It is important to use the full distribution 
dP / dpt{Atiie) as given by eq. ([6]) rather than the Gaussian distribution, because a physically 
interesting domain is that in which z^^tiie is of order 1. The extracted value pext of the UE 
Pt density in the tiled approximation is given by the median value of Pf,tiic/^tiic across the 
many tiles in a single event. It can be determined from the solution of the equation 

/ dp (Atile) = -, (10) 

Jo dpt 2 

This result has been obtained in the limit of there being a large number of tiles, i.e. large ^tot; 
which allows us to approximate the distribution of tile transverse momenta in a specific event 
with the average probability distribution ^(Atiie) (see also appendix IB.ip . The integral in 
eq. (jlOp is non-trivial to evaluate analytically, however an approximation to the solution for 
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Pext that is accurate to a couple of percent and has the correct asymptotic behaviours is given 

by 

Pe.t^P ■ ^ , 1 9Mtiic-ln2). (11) 

i/Atiie - In 2 + i 

The result is non-zero only for i^Atiic > hi 2, which stems from the requirement that tiles with 
no particles, i.e. the S{pt) contribution in eq. should not account for more than half of 
the total number of tiles. This property of a sudden turn-on, as well as the fact that at large 
i^Atiie the offset from the correct p goes as l/(z^^tiie)5 

^^ = -^ + 0(Mtne)-^), (12) 

are features that we have found to hold for certain other analytic forms of dPi/dpt, notably 
all those with a structure pY^e-i"^'+'^)pt/p- ^f^j- arbitrary positive m). Other characteristics, 
such as the particular coefficient of the l/(z^yltiie) offset in eq. ()12p . or the analytic structure 
close to the turn-on, do depend on the form taken for dPi/dpt- 

The determination of p from the median of jets' pt densities differs from the above "tiled" 
model in that jets do not have a fixed area. There is no simple way of extending the analytical 
model so as to account for this, however one can study the impact of the question numerically, 
by examining toy events in which many soft particles have been generated according to eq. (H]) , 
with random positions in the y — 4> plane. On each event one runs the jet-based procedure 
to determine pext and compares it to pext for the tile-based procedure and to the analytic 
approximation eq. (jlip . We assume that tiles of ^tUe are comparable to jets of the same 
average area, ^tiie = (^jet)- One might think that the average jet area should be given by 
the result for ghost jets in Ref. [12], (^ghost-jet) — O.SSvri?^. However, this only holds in the 
limit of very dense UE; for the typical kinds of configuration that are of interest to us, it will 
be more appropriate to use 

(^jct) ^ cjR^ , cj ~ 0.65^ ~ 2.04 , (13) 

where we have defined a constant cj that will reappear in several places below. 

Given this relation between the typical jet area and radius R, we can deduce the critical 
radius, .Rcrit, below which p is zero, 



i?cnt - - • a/^ - 0.41 ^ . (14) 
p \ 2cj p 



We can also rewrite eq. ()12p in terms of R and a/p or R, 



■cut • 



"""" +ofT4)=-:^|f + of^l^ (15) 



p Ap^cjR'^ KP^R^J 21n2 i?2 ' ^4 

The above results depend on the specific form of toy model that one chooses. To estimate 
the importance of this model dependence, one can replace dPi/dpt as given in eq. @ with 
the alternative form 

1 ^P. n n I 



Pi dpt 

as discussed in more detail in appendix |Al The essential relations for this model are a/p = 
vWW, ^crit = ^V(21n2)/(3cj) ~ 0.48^ and 

2.= ,o(j^]^-^^^,Om. (17) 



9p2cji?2 VP^^V 31n2 i?2 ' y ji4 
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Figure 3: Comparison of jet and tile median-based determinations of p for two toy soft-event 
models (as described in the text), together with the corresponding analytical approximations. 
The mean number of particles per unit area, v, was 5 (commensurate with typical expectations 
for the UE at LHC) and the jets that were used had \y\ < it (this choice stems from our use 
of square tiles). To achieve sub-percent- level agreement between the analytical and jet-based 
results at larger R values, it was necessary to carry out the determination of pext using pt- 
scheme recombination of 4-momenta in the jets. 

These results involve the same analytic structures as for the original form of the toy model, 
with numerical coefficients that imply slightly smaller corrections for finite R. 

In fig. [3l the approximate analytical toy model results as a function of R (or the equivalent 
^tiie) are compared to the average results obtained by applying the "tiled" median approach 
as well as the jet-area median approach to toy-model configurations. This is done for both 
variants of the toy model. There is near perfect agreement between the analytical approxi- 
mations and the average median tile-based results. This is indicative both of the quality of 
the analytical approximation and of the limited impact of the practical use of finite Atot (here 
47r^), as opposed to the large Atot limit that went into the analytical results. The median 
jet-based results are rather close to the tile-based results for larger R valuesjl though the 
precise shape in the turn-on region differs a little with respect to the tile-based expectation. 
Moderate differences exist between the results with the two choices for dPi/dpt, and these 
can be taken as indicative of the magnitude of the model-dependence in the above analysis. 

One message to take from fig. [3] is that for an R value that is twice that where the turn-on 
occurs, pext underestimates p by about 10 — 20%. This can be kept in mind as a ballpark 
value for the accuracies that we will be able to achieve and can be compared to the 20 — 30% 
effect discussed above for the TransMin p determination in the traditional approach. 

^To get this level of agreement for _R ~ 1, it turned out to be necessary to use the pt recombination scheme 
in determining the jet momenta. 



, . , . 1 . , . , . 

: V = 5, exponential distr., |y|<7r 




/ y,' analytic approx. : 

/ // median tile p o 
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3.2 Purely perturbative events 



In our definition of purely perturbative events there is no underlying event and the only 
regions of the event that have non-zero momentum are those that contain a perturbative 
emission. Nevertheless UE-determination methods may still give a non-zero result for their 
estimate of the UE energy density, because of the way they are affected by those extra jets. 

We work here with the assumption that we have selected events with at least two hard jets 
(or with aW or Z boson), and that extra jets may be present at higher perturbative orders. 
A crude, but illustrative approximation for those higher orders will be obtained as follows. 
We will take the dominant source of emissions to be radiation from the initial-state partons 
that enter the hard reaction0 Furthermore we will assume that those emissions are soft, 
distributed uniformly in rapidity and azimuth, and with a pt spectrum given by as{pt)/pt- 

dn asijpt) ^^g^ 



dptdydcj) vr2 p^ 

where Ci is the colour factor associated with the incoming partons (Cj = Ca = 3 for gluons, 
Ci = Cp = ^ for quarks). For a dijet event whose two hard jets have transverse momenta 
Pt,hard) wc will take eq. (fTHj) to be valid independently of rapidity for Qo < pt < Q, where 
Q ~ ^Pt.hard and Qo ~ IGeVl We will also assume that emissions are independent. Thus, 
the probability distribution for the number of emissions will be a Poisson distribution whose 
mean is obtained by the integral of eq. (jlSp over the relevant phase space. 



Traditional approach. The average pt densities in the Average and the Max regions will 
both receive contributions from the emission of one gluon (relative to the Born diagram for 
the process). In contrast, the Min region only receives a contribution when at least two gluons 
have been emitted. One can obtain the Pext,Av value just by integrating eq. (fTSj) up to Q, 
which we do in a fixed coupling approximation, Os = as{Q), since the integral is dominated 
by values of pt ^ Q: 

(Pext.Av) = TT-p / Ptdpt I dydcp = . (19) 

2ATrans Jo ^2ATrans dptdydcp TT^ 

Here we have neglected the (small) impact of Qq and, in the fixed-coupling approximation, 
the result is complete to all orders in as ■ One feature to note about the result is that it scales 
with Q. 

To determine (/9ext,Min) to O [a^), we assume that the left and right transverse regions 
each contain one gluon, and that the left-hand gluon (L) is harder than the right-hand one 
(R); (/Oext.Min) is then given by the average transverse momentum in the right-hand region, 

, \ 1 f Cias ATr^nsV f^dpt.L r^'"^ dpt.R ^ f Qa 

(Pext,Min = 2- 2 / " / —Pt,R = 2 3 

^Trans V J Jo Pt,L Jo Pt,R \ 

''ignoring radiation from any outgoing (Born) partons is not too poor an approximation, because a significant 
part of that radiation will be contained within the corresponding jets. 

^Based on coUinear factorisation, one would expect that the upper limit on the pt of emissions to have 
significant rapidity dependence. For example, if the hard process takes place at central rapidities, then one 
might write pt < Pt.hardfi"'"' . The rapidity-independent approximation is instead inspired by a high-energy 
factorisation picture, relevant when pt <C v^e"'"'. Studies with Iferwig at parton- level (based on coUinear 
factorisation) give a distribution for the upper pt limit on extra jets that is intermediate between these two 
expressions. 



j ^TransQ, (20) 
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with an additional factor of 2 to account for the case where the right-hand gluon is the harder 
one. The result is proportional to a^, i.e. suppressed by an extra factor of Os compared to 
Pext,Av, however it is enhanced by a factor of Axrans- 

Finally we can estimate (pext,Max) using the relation Pext,Min + /3ext,Max = 2pext,Av, giving 

us 

C (y. { C ' ol \ ^ 

(Pext.Max) = 2 Q ~ ^ ( ) ^'^^^^^ ^ ' ^^^^ 

To appreciate the impact of the various terms, let us take Cj = Ca = 3, Q = 50 GeV, 
as{Q) = 0.13, and ^Trans = 27r/3. Then we obtain 

(/5cxt,Av)^2.0GeV, (22a) 
(Pext.Min) ^0.3GeV, (22b) 
(/?ext,Max)^3.6GeV. (22c) 

These numbers scale roughly linearly with Q. The crudeness of our approximations for the 
perturbative part of the event means that they are not be trusted to better than within a 
factor of two (worse in the case of (/9ext,Min))- However the rough orders of magnitude are 
still instructive and highlight the advantage of the Min region. 

The above analytic estimates can be verified by using more realistic events from a Monte 
Carlo generator at parton level. For the case of dijets from Herwig pp collisions at y/s = 
lOTeV, with pt of the partons in the hard 2— 7>2 process required to be above 100 GeV (con- 
sistent with Q = 50 GeV) and the soft underlying event turned off, one obtains 



(PexU)=^2.1GeV, (23a) 
(Pe^^Min)=^0.5GeV, (23b) 
(/^^^Max)=^3.8GeV. (23c) 

These numbers are very close to those from the simple model of purely perturbative underlying 
event described above for gluon jets (though Herwig has an admixture of quark jets here). 
We have verified that if we double the area over which the measurement is carried out, the 
(P^?Min) result roughly doubles, as expected from eq. (f20|) . 

One comment concerning the above results is that in the pure soft UE case it was /5cxt,Av 
that was the least biased estimate of p. Here it is Pext,Min that is the least biased by hard 
perturbative radiation. If one restricts one's attention to /Oext.Min, then a further property of 
interest is that in the soft UE case, the bias is reduced by increasing the transverse region's area 
^Trans5 while for hard perturbative contamination increasing j4Trans increases the bias. This 
trade-off between the two issues is characteristic of the difficulty of accurately estimating p. 



Area/median-based approach Let us suppose that we extract p based on jets contained 
in a region of area A^ot- Assuming the typical area for jets as introduced in eq. (|13p . then the 
typical number of jets N in the region (including ghost jets) should be given by 

The exact value of N in each given event will depend on that event's detailed structure (and 
the exact set of ghosts), but eq. ()24p should be adequate for our illustrative discussion here. 
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Of the N jets, we will assume that are "hard" jets, of which nh correspond to the 
final-state born particles (n;, = 2 for dijet events, rif, = for Drell-Yan events) and Up stem 
from perturbative radiation. It is convenient, albeit somewhat simplistic, to model Up as 
being given by the number of emitted gluon^ 

(uh) = nfe + (up) ':^nb + ^ [ d4>dy [ —as{pt) = rn, + Atot ^TjT- In , (25) 

^ Mot Jqo Pt ^ 26o as{Q) 

where we consider the number of perturbative emissions between some non-perturbative scale 
Qo and an upper limit Q related to the hard scale of the process (e.g. half the pt of the hardest 
jet, as before) and we have used a 1-loop running approximation for as{pt)- 

For the median estimator of p to be non-zero, at least half the jets should contain pertur- 
bative radiation, i.e. Up + n^, > N/2. Since the number of primary emissions follows a Poisson 
distribution, we get the probability of non-zero p from the following sum 

A (np)"e-<"^) (np)W2-"^)e-W) 
^ ^ n! - {N/2-n,)\ ' ^ 

where we have also made the approximation that the sum is dominated by its first term, on 
the grounds that {up) /{N /2 — rih) <C 1. Given P, one can estimate pext by observing that the 
(A^/2)*'^ jet will be the softest of all the perturbative jets, and therefore have pt ~ Qo, giving 

This is plotted with thick lines in fig. HI (left) as a function of ^tot for R = 0.6, using a 1-loop 
5-flavour coupling with A = 0.1 GeVlJand with Q = = 3, Q = 50 GeV, Qo = 1 GeV and 
two values for rife, and 2. 

To estimate the uncertainties of our analytic formula introduced by approximations ()26p 
and ()27p we also plot with thin lines in fig. [J] (left) the result of numerical studies of the 
same simple set of perturbative emissions (equivalent to the full sum over n). As before, 
we associate each parton (i.e. gluon in our case) with one jet and assume that this jet has 
area cjE?. In the case ni, = 2, we make sure that the two Born particles are always present 
in the region where the underlying event is measured. One sees in fig. H] that the exact 
numerical results (equivalent to taking the full sum in eq. (|26p ) are moderately higher than 
the corresponding analytic approximations, as would be expected. In both cases, however, 
the contribution to pext is negligible except for small values of Atot with rib = 2. 

The approximation that we have used for the distribution of the number of jets is rather 
simplistic. To estimate the order of magnitude of the uncertainties that are present in our toy 
model of the perturbatively induced "UE" , notably those associated with secondary radiation 
from the partons, we have also applied the area/median method to realistic dijet events from 
Herwig with soft UE turned off, taken at parton level. To make the comparison, we required 



^Recall that our model includes only primary emissions from the incoming partons — we neglect cases 
where two of these emissions end up in the same jet (which would decrease Up), and also the fragmentation 
contribution from the Born and radiated particles (mostly contained within the respective jets, but which 
would increase Up in the cases where the fragmentation is outside those jets). 

^This mimics a 2-loop coupling with as{Mz) = 0.120 to within a couple of percent over a broad range of 
scales. 
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Figure 4: Left: pure perturbative underlying event extraction via the area/median method, as 
a function of the total area, for cases with zero and two Born particles. The curves correspond 
to the simple model of perturbative emissions described in the text. Thick lines show the 
analytic estimate of pext based on the RHS of eq. ()26p and thin lines correspond to the use of 
the median on our model for the ensemble of parton emissions (see text for further details). 
Right: comparison of the analytical prediction for the case with 2 Born particles with results 
from parton- level (no UE) events generated with Herwig, and clustered with C/A algorithm 
(points). Two analytical results are shown: one with (up) as given in eq. ()25p (1 jet/parton) 
and another in which {up) — t- 1.3 (up) + 0.3nf,, i.e. each parton generates an average of 1.3 jets. 
The figure also shows a curve (analytical) and points (Herwig) that illustrate the impact of 
removing the two hardest jets from the list used in determining the median. 



that the partons in the hard 2— )-2 process have pt > 100 GeV, as before (and consistent with 
our choice Q = 50 GeV). The result is shown in fig. H] (right) with black and open circles for 
the cases with and without removal of the two hardest jets from the ensemble used for the 
median. The ^tot -dependence of the Herwig results differs somewhat from the expectations 
in eqs. (|26p and (j27p . We attribute this to the fact that each parton (be it a Born parton or 
a primary emission) can itself radiate extra gluons, some of which will lead to additional jets. 
A simple way of accounting for this is to replace (up) in eq. ([26]) with (1 + X) {up) + Xrib, 
where X is the number of extra jets obtained per parton|f| Using a modest X = 0.3 (1.3 jets 
per parton) brings the analytical result into accord with the Herwig results. 

In terms of the practical impact of perturbative emission on the extracted p values, one 
sees that for j4tot ^ 47r (the minimal value that we shall use in section |3|) it remains a small 
effect, though the curves also highlight that at small total area Atotj the area/median method 
can start to become sensitive to perturbative radiation, especially when Born partons are 
present. 

A final point to comment on is the relation between these results and the discussion in [13j . 
where it was argued that pext would be non-zero starting only at O (af^^^ "''). Examining 
eq. ()26p . one sees that this statement stems from the fact that (up) ~ ag (in a fixed-coupling 

*A more sophisticated approach would be to calculate X using soft-gluon resummation, which would lead 
to a single-logarithmic dependence of X on the parton transverse momentum. 
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approximation). However, each power of ag is compensated in part by a power oi AlnQ/Qo 
(which is large), and ultimately the small value of P in eq. (j26|) (and hence (pext)) cannot 
be solely attributed to an ag power-counting argument, but rather involves a more subtle 
interplay of all the factors in eq. ([27|) . 



3.3 Two-component events 

Realistic events are neither purely perturbative nor consist of pure soft noise. It is instructive 
to examine what happens if one considers events that have both components together. 



Traditional approach. The transverse-momentum density extracted from the average of 
the two transverse regions is straightforward to calculate in the two component model: it is 
just given by the sum of the soft and perturbative components. 



/ \ , C'iO^ 

(Pext,Av) = p-\ 5-Q • 



(28) 



The results for the Min and Max regions are more complex: in the pure soft-component case, 
it was the soft radiation that determined which of the two regions was Min/Max; analogously, 
in the perturbative case, it was the perturbative radiation that determined this. When both 
can be present one has to consider which of the two fixes the Min/Max regions. It is useful 
to define V to be the fraction of events in which the amount of perturbative radiation in each 
of the two transverse regions is smaller than the size of soft fluctuations of those regions, 
Pt,L,Pt,R ^ (T-v/^Trans- In this Set of events, it is the soft component that defines which 
region is Min/Max, and the bias in the extraction of p is just the soft bias, eqs. ([9]), with no 
perturbative bias. In the remaining events, in which one or more perturbative emissions are 
much harder than cr-v/^Transj it is those perturbative emissions that will determine which of 
the two regions is Min/Max. For these events, there will be no bias in the contribution from 
the soft component. This implies that the average bias in the Min/Max regions for the soft 
component will be ■ a/^/^TA^^^^. 

As concerns the perturbative contamination, the average results in eqs. (j20p and (|2ip are 
already dominated by the set of events in which there is at least one hard emission, so these 
contributions remain unchanged in the two-component case. The final result for the Min/Max 
regions is therefore 



(Pext.Min) - P 
(Pext,Max) - P ' 

where V is given by 



+ ^ I — I ^Trans V : 



\/ TT^Trans 

aV 



+ 2 



-Q-2 



~ exp ( -2^Trans l^^^"^^ 



a/ TT ^Trans 

Ci 1 Q;s(max((5o, CrV^Xrans)) 



^Trans Q ■ 



asiQ) 



(29a) 
(29b) 

(30) 



Note that the choice of lower scale in the logarithm, uV^Trans) or Qq if that is larger, is only 
controlled to within a factor of O (1), just as is the choice of Q for the upper limit. 
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Area/median-based approach. The combination of low-pt and perturbative components 
is non-trivial also in the case of the area/median-based approach. To treat it analytically, 
it will be convenient to work at R values that are sufficiently large that the distribution of 
Ptjet/^jet for the many jets can be considered approximately Gaussian — i.e. we will work 
away from the turn-on region I'Ajet = In 2 that was discussed in section 13.11 

Of the N jets that are used in determining the median, some will contain hard pertur- 
bative radiation with transverse momentum significantly above the scale of the fluctuations 
of the UE. Assuming that there are on average (uh) hard partons, and that the probability 
distribution of hard partons in a jet (or tile) is given by a Poisson distribution with mean 
(n/i) /N, then the average number of jets not contaminated by the hard partons will be given 
by Ne~^^^''^^ ~ A — {uh) + (n/i)^ /(2A). These uncontaminated jets will have a distribu- 
tion of values of pjet = Pt,jet/^jet that is governed just by the soft component and is roughly 
Gaussian 



Assuming {uh) < A In 2, the median procedure implies finding pext such that A/2 of the 
]\[Q-{nh)/N Gaussian-distribution jets have pjet < Pext, i-e. one must determine the value of 
Pext such that 

r-Pcxt (in('^°ft) A , , 

apjot-i = (32) 

(the unphysical negative lower limit of the integral, an artefact of the Gaussian approximation, 
doesn't perturb the argument). In the small (n/i) /A limit, this is easily solved and gives 

where (/Ogxt^*'') is the result obtained in the pure soft case, eq. (fTT|l of section 

One can 

then use eqs. (fT3]l and (|2l]l to express (Ajet) and A in terms of R and Atot- Keeping only the 
first two terms in the R expansion gives 

,34, 

Features to note here are that the discrepancy is proportional to aR, with the next correction 
going as aR^ . Eq. (|25p provides a result for the average number of hard emissions (n/i), and 
Qq there should be replaced with max(Qo)C' ((Ty^Ajet)), or equivalently max((5o, ^/cjcri?), 
because perturbative emissions whose transverse momentum is much smaller than the scale 
of fluctuations of the underlying event will not bias the median. This then gives us 

{rih) m d L as{m.ax{Qo, y/cjuR)) 

We see that the Born particles contribute when ^tot is not very large. Perturbative emissions 
always contribute, essentially because the number of emissions scales with the total area. 



^The additivity of soft and hard results is an approximation, justified only when (p^xt^*'') is close to p. An 
additional point is that when plotting soft+hard results for (pext), we will eliminate the 0-function in eq. Hll|l 
and use the prescription that (pcxt) is well-defined only when the sum of soft and hard contributions gives a 
positive result. 
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Substituting physically reasonable numbers into eq. (j35p . i.e. which corresponds to setting 
the logarithm L equal to 1, gives 

<=i>=.-pL + o,25^, (36) 
which then gives the following numerical result for the bias, 

(Pext) - (p?.f V0.45ai? • +4.0-^) +0.11ai?3 • + 4.0-^)' . (37) 

Ignoring the rib/Atot terms, for R = 0.6 in gluon-initiated processes (Cj = Ca), the bias 
introduced in p is about 0.29cr. For values of cr = 0.5/3 — 0.7/9, as we will obtain from the 
Monte Carlo study in section HI the positive bias due to these perturbative effects is in the 
same 15 — 20% ballpark as the negative bias due to finite-particle density that was discussed in 
section [3TT] for pure soft events. There is an expectation that these two sources of bias should 
combine linearly, at least when R is sufficiently far above the critical turn-on point that the 
Gaussian approximation used above is valid. Since they have different -R-dependences, ~ +R 
and ~ —1/R^ respectively, there exists only a limited range of R in which they compensate 
each other. In this respect the numbers given above tend to confirm the choice R ~ 0.5 — 0.6 
that had originally been recommended based on Monte Carlo studies in [13] 

Fig. [S] summarises the study of the two component model by showing the analytic results 
from eqs. ()34p and ()36p together with the numerical results from the median tile-based and 
median jet-based approaches. Two sets of results are presented, corresponding to the events 
with and without Born particles. We see the very good agreement between the analytic and 
the numerical tile-based approaches and the median jet-based result (except in the threshold 
region, as in section [3T]) . To achieve this it was essential to include the term ~ R? in eq. ()34p . 

Fig. [5] suggests that the region of R with the least bias for the determination of p is 
R = 0.4 — 0.6. If one requires that the biases in eqs. (fT5]) and (fMj) cancel each other, 
then one finds that R should be chosen proportional to 

1/1/6 or equivalently proportional to 
{(j/pY^^ ~ -^crit- Ignoring the n^/Atot component and the R^ term in the equations of this 
section, one finds 

where the numerical values have been obtained setting L = 1 in eq. (|35p . The result for 
i?zero-bias Can be Seen to be consistent with fig. [6l which shows the analytical approximation 
for pext as a function of R for a broad range of particle densities v. For a variation in the 



An alternative method of extraction of p comes to mind at this point, fitting a formula motivated by the 
results of this section and appendix 1X1 

Poxt=p( ^1 (1 + ci?), (38) 

with the fit parameters p, Rait, n, c. One might choose to forgo n, or try a finite number of choices, e.g. 
n = 1, 2. We have found that such a procedure eliminates a substantial part of the biases in the extraction 
of p for some events, but in others statistical fluctuations lead to poor fits, with results for p that are highly 
skewed towards low values. For this reason, we do not adopt this approach for the current study. 
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Figure 5: Underlying event from the two component toy model analysed in the range of 
rapidity \y\ < n and the full span of the azimuthal angle. The results for the average pext 
(normalised to p), extracted in tile and jet median-based approach (see text) are shown for two 
sets of events. The first set contains soft and perturbative particles while in the second set, 
two Born particles with pt^hard = 100 GeV are present in addition. For reference, the analytic 
curve from fig. [3] for the pure soft case is also shown. The mean number of soft particles per 
unit area, z^, was 5 and their average transverse momentum p, = 0.4 GeV, which corresponds 
to /9 = 2 GeV. The perturbative emissions are distributed between scales Qq = 1 GeV and 
g = 50 GeV. 

particle density (and p) by a factor of 50 {a/phy a factor of 7), the zero-bias R value changes 
only moderately and in close accord with the expectations of eq. ([39]) . Fig. [6] also illustrates 
that a fixed R ~ 0.6 leads to a pext value to within 20% of the correct result for a whole 
range of ly, with the relative impact of the biases steadily decreasing as the particle density 
is increased. 

3.4 Fluctuations in the estimation of p 

In the simple model studies discussed here, the same p value has been used to generate 
all events. Nevertheless, extracted values of p vary from one event to another. This is 
because any method of measuring UE can use only a limited part of an event (restricted 
^tot; ^Transi according to y and (p cuts) and works with a finite number of objects (particles, 
jets). The magnitude of the observed event-to-event fluctuations in this case is an important 
characteristic of a method, because it sets a lower limit on the uncertainty of the event-to- 
event p measurement. These intrinsic fluctuations also affect the measurement of the true 
fluctuations and correlations of realistic UE models. Therefore, in order to measure properties 
of the underlying event that can then be used for efficient subtraction or tuning of simulation 
programs, one is interested in reducing the fluctuations that come with the method itself. 

For the soft underlying event from section 13.11 ignoring the small systematic biases that 
we found there in the determination of p, one can show that the standard deviations of 
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Figure 6: The analytical approximation for pext as a function of R, eqs. (|ll|34p . shown for 
a range of soft-particle densities (z/ values), for the gluonic case, = Ca- The curves from 
left to right (at small R values) correspond to u = 100, 50, 20, 10, 5, 2 (or equivalently p from 
40GeV to 0.8 GeV). 



the extracted p values read 
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The result for the median case is derived in appendix IB.ll The formula for the Min/Max 
regions was obtained in the Gaussian approximation, valid in the limit i^^Trans ^ 1- Substi- 
tuting realistic values for the areas, i.e. ^Trans = 27r/3, ^tot = ^vr, and the density u = 5, one 
arrives at the following numerical estimates 

S^IZ/P = 0.22, 5;:fVp = 0.31, ^Si/Max//' = (41) 

The lower expected fluctuations for the area/median based approach are a consequence of 
the larger area used in the UE determination. The use of a larger area is possible in the first 
place because the method's dynamical separation of UE limits the need to cut away regions 
from the y — (p plane to reduce contamination from hard jets. 

In fig. [71 we show histograms of p extracted in the traditional and area/median approaches 
for the case of purely soft underlying event from section [3TT] (left) and for the combined model 
described in section 13.31 with the soft and perturbative components and 2 Born particles 
(right). The corresponding standard deviations of the pext values are given in table [TJ For 
the case of the purely soft UE, we see that these standard deviations follow the pattern from 
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Figure 7: Distributions of pext from traditional and area/median based approach for the case of 
pure soft underlying event from section ISTTl (left) and for the combined model from section [331 
with the soft and perturbative components and 2 Born particles (right). The average density 
of soft particles is v = 5 and their average transverse momentum is ^ = 0.4 GeV, which 
corresponds to p = 2 GeV. The perturbative emissions are distributed between scales Qq = 
1 GeV and Q = 50 GeV. To guarantee that the two hardest jets are associated with the 
Born particles, their transverse momenta were set to pt = 400 GeV. The jet clustering was 
performed with the Cambridge/ Aachen algorithm with R=0.6. 







Sd,cxt/P 


pure soft 


soft + PT + 2 born 


median/area, all jets 


0.21 


0.40 


median/area, all-2 jets 




0.33 


traditional, TransAv 


0.30 


1.81 


traditional, TransMin 


0.29 


0.90 



Table 1: Results for Sd^evit/ P for various methods of extracting p in the toy model where the 
input standard deviation of p is 5^ = 0. These values provide an indication of the lower 
bound of Sd values that could be observed in real events. 
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eq. (j4ip . with the area/median approach performing best. Note however that finite-density (u) 
effects do tend to shghtly reduce the standard deviation results as compared to expectations, 
especiahy in the TransMin case. 

Adding perturbative and Born particles increases fluctuations in all cases. From the 
shapes of the curves in fig. [7] (right), it seems that the area/median and TransMin results are 
both degraded by similar amounts, while the TransAv result suffers significantly more, with 
a long tail to large pext values. The strong degradation of the TransAv result is an expected 
consequence of its sensitivity to perturbative radiation, 

where one observes the dependence on a^/^Trans (to be compared to the bias on the mean 
which goes as as)- 

However, if one examines the results for Sd in table [H one sees that the TransMin standard 
deviation is also significantly increased by perturbative radiation. The toy-model expectation 
is 

^S-^Q(l + 0(«?<a„s)), (43) 

and for the particular parameters used in table [H the result for Sd turns out to be as large 
as p itself. Compared to the O (a^^) suppression for the bias to the (pext.Min) result, here 
the perturbative radiation bias is much stronger, O (a^). The physical explanation is simple: 
while it is relatively rare for perturbative radiation to affect the TransMin region (hence the 
acceptable peak-region of the pext distribution), when it does, the effect on pext is large, 
contribution significantly to the final result for Sd^ 

The area/median approach is much more robust in this respect, because the hard emis- 
sions' contribution to the standard deviation does not have significant enhancements com- 
pared to the average bias on p: 

as derived in appendix IB.2I In particular, the larger numerical coefficient compared to the 
O (R) term of eq. ([37|l is compensated by the factor of Atot • This good behaviour is 
visible in fig. [71 and also in the values of table [U which are roughly consistent with the above 
analytical estimate (maybe 20% higher). They also highlight the further improvement to 
be had with the hard-jet removal procedure discussed at the end of section [2?2] ("all-2 jets" 
result), which benefits not just Sd, but also the peak position and height in fig. El 

Overall, the results of this section suggest that for any measurement of fluctuations of 
the UE, it will be preferable to use the area/median method, with hard-jet removal able to 
provide some extra benefit. 

^""^The hard component also has an impact on the soft fluctuations, in analogy with the effect discussed for 
(pext,Min) in sectiou |3^ We ignore this complication here since the fluctuations of the hard component in any 
case dominate over those of the soft component. 
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Figure 8: Left: extracted value of a with the area/median approach in the pure soft and 
the two component toy models, shown as a function of R and normalised to the correct 
value for the soft component fisoft = /Osoft\/2/i/. Right: the same set of results, but showing 
(<7ext) / {Pext) normalised to the expected a/p for the soft component. The toy model variant 
is that corresponding to eq. (jH). 



3.5 Extraction of a 

The measurement of intra-event fluctuations, a, has only been discussed so far in the context 
of the area/median approach^ Though we shall not go into full analytical detail, it is easy 
to convince oneself that many of the considerations that arise in the extraction of p apply 
also when determining a. In particular, for pure soft events one underestimates a when R is 
too small and the presence of perturbative radiation will bias the extracted a at larger R. 

One point to be aware of is that our method for extracting it, cf. section 12. 2^ has the 
characteristic that a / y^^jet never exceeds p. This translates into an i?-dependent upper 
bound on aext/Pext, 

^<^jR^lA?>R. (45) 

Pext 

Fig. [8] (left) shows the average Uext as a function of R in the toy model (z/ = 5, with and 
without the perturbative radiation and the Born jets), normalised to the correct result for 
the soft component. In the case of just the soft component, one sees a threshold region 
followed by a slow approach towards the correct value, much as for pext even if the detailed 
shapes differ (in part owing to the one-sided determination of o", which causes the residual 
bias to be proportional to rather than 1/R'^ for the bias to p). With the inclusion of 
the perturbative component there is an additional bias, which grows towards large i?, again 
much as happens for jOgxt- 

To compare the biases on cjext and pext) it is convenient to examine the ratio {(Text) / {Pext), 
fig. [8] (right), normalised to the soft-component result for a/p. The impact of the bound on 

^^In the traditional approach, one might envision calculating the perturbative contributions in eqs. P8I29|) 
to NLO with a program such as NLOJetH — h [33] and then fitting for (p) and (a) in those equations after 
averaging over many events. It seems that it would be difficult, however, to extract event-by-event estimates 
of a. 
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(j//0, eq. ([15]) . is clearly visible up to ~ 0.5 (consistent, roughly, with a / p ^ 0.63). Beyond 
this point, over a reasonable range of R, (cJext) / (Pext) remains compatible with the true value 
to within roughly 10 — 20%, and, as expected, deviations are larger in the presence of hard 
radiation. Overall, in the region of R that is suitable for extracting p, figure [5] suggests that 
the extraction of a should also be quite acceptable. 

3.6 Summary of main results 

Table [2] summarises the main results of this section for the biases, 5p, and the event-to-event 
fluctuations, S^, that occur within the toy model in extracting p, the transverse momentum 
flow per unit area, with each of three UE estimation methods: the traditional approach in 
its TransAv and TransMin variants, whose main parameter is the area Axrans of each of the 
two transverse regions; and the area/median approach whose parameters are the total area 
under consideration, A^ot and the jet radius R. Results are given both for the biases and 
fluctuations intrinsic to the soft component and for the additional biases that arise due to the 
presence of hard radiation in the event, expressed as 

p^^, =p + + ^ ^(soft) ^ ^(hard) _ ^^g^ 

The analytical formulae help illustrate the dependence on the parameters of the measurement 
methods (the area of the regions used, the jet radius) and the physical scales present in the 
events (hard scale Q, and the values for p and for the level of intra-event UE fluctuations, a). 
One sees, for example, how the hard biases and event-to-event fluctuations in the traditional 
approach are always proportional to Q and to some power of a^, whereas in the area/median 
approach they are proportional to a (with modest coefficients and weak additional InlnQ 
dependence contained in the parameter L). 

Table [2] also gives numerical results for the biases and Sd values, based on the default 
set of measurement parameter choices and physical scales that have been used throughout 
this section. This helps illustrate the expected orders of magnitude of different effects under 
realistic conditions. The large results for s'^^'^^'^^ in the traditional method (i.e. unreliable 
event-by-event extraction of /?), together with the proportionality to Q of the traditional 
method's biases and fiuctuations and the fact that it offers no easy way of determining o", 
lead us to prefer the area/median method for the Monte Carlo UE measurement studies that 
we will perform below. 

Other results of this section that are not summarised in table [2] include: -Rcriti eq. the 
ii!- value below which the area/median approach gives pext = 0; the upper bound on (Jcxt/Pcxt 
as a function of R in the median-area method, eq. ([15|) : and i?zero-bias) eq. ([5^ . the R value for 
which the soft and hard biases cancel out in the area/median approach. Finally, fig. [5] shows 
the characteristic shape of the -R-dependence for p^xt in the median/area method, while fig. [6] 
helps illustrate how R = 0.6 is a reasonable default choice for a wide range of UE conditions. 

4 Illustration with Monte Carlo events 

Given the area/median method to determine p and a on an event-by-event basis, let us now 
explore what kinds of observables we might construct from them. The choices that we shall 
make are motivated by considerations of how the UE affects jet measurements at hadron 
colliders. For example the UE leads to an average shift in jet energy; it's important that 
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quantity 



method 



result 



numerical value equation(s) 



^^(soft) 



Trans A V 



TransMin 



Area/Med* 



V 



P \/ Vr^Trans 
1 







-0.09 



-0.14 



9b|), ([2911) 



ard) 



TransAv 



TransMin 



Area/Med 



Cjas Q 
vr^ p 

2 

Ci^s \ ^TransQ 



P V 2 V^tot 



7r2 26o 



0.99 



0.16 



0.17 



(USD, mi 

(EQD, (l29iD 



5 



(soft) 



TransAv 



TransMin 



Area/Med 



1 



p V 2^Trans 

a I IT ■ 



1 



p V VrAxrans 



p V 2 A 



tot 



0.31 



0.36 



0.22 



(liObl) 
(liOal) 



5 



(hard) 



TransAv 



TransMin 



Area/Med 



4ATransVr2 /9 



vr- 



'^/2 p 



cji? /27rcj / rif, 
P 



tot 



A 



+ 



tot 



^2 2bn 



1.72 



0.70 



0.19 



Table 2: Summary of main biases and sources of fluctuations for dif- 
ferent UE extraction methods; Q is the hard scale, Qq the IR cut- 
off on perturbative emissions, L = In [as{raax{Qo, \/cja R)) / as{Q)] , V ^ 

exp (^-2ATrans fi^S^^ln [a5(max(Qo,o-\/^Trans))/as(Q)]), cj = 2.04, Ci is the colour 
factor of the incoming partons. Numerical values are given for p = 2GeV, a/p = 0.63, 
Q = 50GeV, Qo = 1 GeV, Q = Ca = S, Ub = 0, R = 0.6, Atot = 47r and ^Trans = 27r/3 (the 
area of a single transverse region), corresponding to L ~ 1 and V ~ 0.35. The result marked 
with a * is specific to the form of the soft toy model discussed in section 13.11 
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one knows that shift as a function of rapidity, and hence one should determine {p{y)). The 
UE also affects jet energy resolution, through a term of the form (T^J~A^]^. Thus we will also 
want to look at {a) as a function of rapidity. A second way in which the UE affects jet 
energy resolution is that p itself is different event by event, so one might therefore examine 
its event-by-event distribution and its standard deviation Sd- The different ways in which UE 
affects jets was summarised in [I2] with the following equation for the variance of the change 
in jet pt due to the underlying event (neglecting back-reaction): 

(Ap2^.) - (Api,,)2 ^ + (AjA,ij) (^2) + (^JA,R>25J , (47) 

where (Aja,/?) is the average jet area and (Sj^ ^) ~ is the variance of the jet area. One 
sees that each of the terms involves a different characteristic of the UE: (p), (o") and S'J. 

Measurements of UE characteristics, as well as being of direct relevance to jet measure- 
ments, also have the power to constrain UE models. This has been the motivation for most 
UE studies to date and we believe that the range of UE characteristics discussed above would 
complement existing types of measurement and so provide an additional powerful set of con- 
straints. Furthermore, as well as examining "local" quantities, such as {p{y)), one can also, 
for example, ask the question of whether there are long-range correlations between the magni- 
tude of the UE in different parts of an event. In (prevalent) UE models that involve multiple 
independent 2 — )• 2 scatterings, one might expect these correlations to be modest, whereas in 
a BFKL-inspired model (as might derive from work like Ref. jl6j). where one or more gluon 
ladders stretch across a whole event, one might expect them to be larger. 

So far in this article we have treated the traditional and area/median based UE mea- 
surement approaches on a similar footing. The results of section [3] suggest that both provide 
reasonably adequate information about average characteristics of the underlying event: both 
methods introduced biases and different sources of biases cancelled partially, limiting their 
overall impact (though in the area/median approach the biases had much weaker dependence 
on the hard scale of the event). In contrast, as we saw in section [33] and its fig. [3 in the tra- 
ditional approach the fluctuations due to hard perturbative emission were likely to dominate 
over fluctuations in the soft component of the UE, therefore for observables that are sensitive 
to fluctuations, we believe that the area/median approach is more robust. For this reason, in 
what follows, we shall concentrate on this second approach. 

This section will be structured as follows: after outlining the set of Monte Carlo simula- 
tions that we use (section |4.1|) and the event selection cuts (section |4.1|) . we examine to what 
extent the toy model is qualitatively similar to the realistic simulations (section 14. 3p . The 
motivation is that the toy model guides our intuition about the measurement procedure, and 
it is important to establish that this intuition is well founded. Having done so we then study 
(section 14. 4p a selection of observables that are relevant for eq. (j47p , including their rapidity 
dependence and also their degree of correlation across different parts of a same event. This set 
of observables (together with some of those in section would, we believe, be interesting to 
examine experimentally, both in terms of the direct information it provides for understanding 
the impact of UE on jets (and isolation, etc.) and in terms of its ability to constrain models. 

4.1 Monte Carlo models used 

We shall examine a series of Monte Carlo UE models: the UE that comes with the "old" 
virtuality-ordered shower in Pythia 6.4 [21], in the DW and DWT tunes by R. Field [18j , 
identical at Tevatron energies, but with different energy-dependences; the UE that comes with 



24 



the "new" transverse-momentum-ordered shower in Pythia 6.4, in the SOA tune [5tl341[35] by 
P. Skands; Herwig 6.5's [551157] default "soft" UE, which fails to reproduce various aspects of 
Tevatron data, but instructive for comparisons between different types of models; Herwig 6.5 
with Jimmy 4.3 [1] in an ATLAS tuncO by Moraes piB]- Ah models are based on multiple 
interactions except for Herwig's soft UE. 

By default we use a pp centre of mass collision energy of ^/s = lOTeV, though we will 
also consider the energy dependence of some of the observables we study. 

A comment is due on the fact that we will carry out our investigations at particle (hadron) 
level. Experiments may carry out measurements on tracks only (well measured, though with 
some low pt threshold), on calorimeter information (subject to noise and noise-suppression 
thresholds) or on some combination of the the two. It is beyond the scope of this article to 
estimate the potentially substantial impact of detector effects on the results presented here. 
Nevertheless, we believe that the differences that we will see between various event-generator 
tunes should persist even after detector effects. 

4.2 Event selection 

We consider dijet events, where the leading jet, reconstructed with the anti-A;^ algorithm [31] 
with R = 0.6 has pt > 100 GeV, the next hardest jet has pt > 80 GeV and both jets are 
required to lie in the rapidity window |y| < 4. Note that since the cross section for jets falls 
steeply, a cut on jet pt introduces a "trigger-bias", i.e. favours events where the UE is slightly 
larger than average P^l The above cut is a default used for the study presented in this section. 
In some places we employ tighter cuts on the rapidities of the two hardest jets in order to 
study the impact of these jets on UE determination. Whenever this is the case, it will be 
indicated explicitly. The choice of the anti-A:t algorithm for event-selection here is motivated 
by the fact that it will be the first to be supported by ATLAS and CMS [39] and therefore 
will have been used by the experiments for their initial event selection. 

For the determination of the properties of the UE within the jet- area/median approach, 
the anti-kt algorithm is not suitable, as discussed in section 12.21 We will therefore use jets 
from the C/A algorithm with R = 0.6 for this task. All jet finding is performed with Fast Jet 
2.4.1 [401141] . and we use the ActiveAreaExplicitGhosts option to calculate areas, because 
this ensures the safest treatment of pure ghost jets. The ghost area that is used is 0.01 and 
ghosts are placed up to \y\ < 9 (we could, however, have used a smaller upper limit for most 
of the plots). The other parameters are left at their default values. 

4.3 Comparisons of characteristics of MC and toy model 

Given the importance of the toy model in guiding our understanding of the measurement of 
the underlying event, let us start by examining whether realistic Monte Carlo events bear 
any similarity to toy- model events. One way of doing this is to examine the iZ-dependence 

^^The non-default parameter setting are: PRSOF=0, JMRAD(73) = 1.8, PTJIM= 2.8(^/1. 8)°-'^'"^ , with 
CTEQ6L1 JSj PDFs (i.e. just the parameters in 

To investigate the impact of the trigger bias, we used the following procedure: we first measured the 
underlying event density p in a rapidity window \Sy\ < 1 around each hard jet, and then placed our hard jet 
cuts on pt,jct — pA-jat rather than on ptjct- The result for (p) in events that passed these cuts on "subtracted" 
jets came out about 10% lower than for events where we cut on unsubtracted jets. Since this is not too large 
an effect, and for reasons of simplicity, we have however chosen not to apply this procedure to our analysis as 
a whole. 
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Figure 9: The left-hand plot shows p{R) extracted with C/A jets in the area/median approach 
for 3 representative Pythia (DWT) events, which passed the selection cuts of section W7I\ The 
right-hand plot shows the corresponding histograms oi pt/A for the same 3 events. 



of the extracted p, which, in the toy model, had a characteristic shape, fig. [5l Fig. [9] (left) 
shows p{R) for three representative Pythia events (DWT tune). In each case one observes the 
turn-on at some threshold R value, followed by a roughly linear slope at larger R, precisely 
as expected. There is substantial variation in the curves from one event to the next, and one 
can trace this back to their distributions of pt/A shown in fig. [9] (right): the blue (dashed) 
line which has small p and little R dependence corresponds to an event in which there is 
no activity at high pt/A values. The green (dotted) curve, which has large p and strong 
i?-dependence, has a correspondingly broad distribution of pt/A values, with much activity 
at intermediate pt values. In this event, the toy-model picture of a clean separation between 
soft and hard physics is somewhat challenged, though the general pattern of p{R) having a 
turn-on followed by linear i?-dependence still holds. 

If we average p{R) over many events we get fig. [10] (left), which shows results for several 
generators/tunes. One observation is that the Pythia tunes have a larger slope: based on the 
toy-model calculations, eq. ()34p . this can be interpreted as meaning that they have a larger 
value for the quantity crn/j, where Uf^ is the number of hard jets. The Herwig default curve, 
with no MPI, has the smallest slope (and p value). The average over many events for (7{R) 
(normalised to {p{R))) is shown in fig. 1101 (right). An interesting feature is the linear rise for 
R = 0.2 — 0.4 (and up to i? = 0.6 for the Pythia tunes). In this region, one saturates the 
bound eq. (j45p . which implies that R is too small for a proper measurement of a. For R > 0.6 
there is a shallower rise, which we interpret as being due to the presence of hard jets, as is 
the case in fig. [8] (though there is less curvature in the MC events than in the toy model) . 

To get an idea of the event-to-event variations of p{R) we use the following procedure: 
given A'' events, for a given R, we sort the events according to p{R). We then define the 
10*^ percentile result for p{R) to be the value of p{R) for event N/10, and similarly for other 
percentiles. Fig. 1111 shows the 10*^, 20*^, etc. percentile results for p{R), as a function of R 
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Figure 10: Average over many events of the extracted p (left) and a (right), shown as function 
of R for four generators/tunes. 



for our 4 generator /tune combinations (together with the average, for comparison)!^ One 
observes the sharp turn-on as a function of R (washed out in the {p{R))). The smaher the 
turn-on point. Rent, the larger than value of p (and the larger the slope). The spread of 
events is noticeably large, both in the values of p and for i?crit) especially, for the latter, in 
the context of Herwig-I- Jimmy. 

Though it is not our intention to highlight figures [TOl and [TT] as main results of this paper, 
we do note that it would be possible to make corresponding experimental measurements, and 
use them as input to MC tunes. 

In the remaining parts of this section we shall concentrate on results extracted with R = 
0.6, which, as anticipated in the toy-model section, seems to offer a reasonable compromise 
between being sufficiently large as to be well beyond the turn-on in most events, while not 
being too severely affected by the rise at large R that is induced by semi-hard radiation in 
the event. 



4.4 Study of selected observables 
4.4.1 Mean energy flow 

Let us start by examining p and its rapidity dependence, fig. [121 for Tevatron and LHC en- 
ergiesJif] The results for (p) in the central rapidity bin for Tevatron (left) should be strongly 
constrained by the standard UE measurements at Tevatron, and this is reflected in the small 
difference between SOA and DW, though there is a somewhat larger difference with the Her- 
wig+Jimmy tune. The rapidity dependence is quite strong, with stronger suppression at 

A subtle point in the production of fig. [TT] is that the event that provides the 10*'^ percentile for (say) 
R = 0.6 is usually not the same event that provides it for R = 0.8. Thus the curves in fig. [TT] are not the 
curves that would be obtained for individual events (these are far less smooth, cf. fig. [SJl, but can be thought 
of as some idealisation of these curves in a world free of fluctuations. 

To simplify the comparisons, we use the cuts of section in both cases, though they involve a rapidity 
range that extends beyond the Tevatron's coverage. 
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Figure 11: lO*'^, 20*^^, etc. percentiles for p{R), function of R for four generator/tunes. 
Also shown is {p{R)) in each case. 



forward rapidities for SOA and Herwig+Jimmy than for the DW tune. One should remember 
in examining the rapidity dependence that there are essentially no experimental constraints 
on the level of the underlying event at forward rapidities — it is therefore a model-dependent 
extrapolation. At LHC energies we see, fig. 1121 (right), that differences appear between mod- 
els also at central rapidities, reflecting an uncertainty in the extrapolation in energy. The 
DWT tune's energy-dependence is disfavoured based on RHIC [l2] and lower-energy Teva- 
tron data [25], but we include it to give an idea of the magnitude of possible differences. 

One question that arises in the measurement of p is the possible bias from hard jets and 
the relation of this bias with the rapidity bin size or equivalently the area Atot in which one 
measures p. In this context, recall that eq. ()37p implies a bias from the Born jets that scales 
as Uh/Atot. In fig. [T2]the hard jets were free to lie anywhere within \y\ < 4 and were included 
in the sample of jets used to obtain the median. In fig. [T3] (left) the red (dark) dashed curve 
is the same as the DW result in fig. [12] (right). The red (dark) solid curve shows the impact 
of removing the 2 hardest jets from the median sample. One sees that this has a rather small 
effect. Next we examine an event sample in which the 2 hardest jets are in the central rapidity 
bin, so that that bin receives all the bias from the Born jets. If those jets are included in the 
set used to calculate the median (dotted green histogram), then the impact on {p{y)) in that 
bin becomes noticeable, O (30%). Removing them from the set for the median (solid green 
curve) brings us almost back to the basic "all-2" result for the full dijet sample. That the 
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Figure 12: The rapidity dependence of the average underlying event activity, {p{y)), shown 
for = 1.96 TeV Tevatron {pp) simulations (left) and ^/s = 10 TeV LHC {pp) simulations 
(right). 
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Figure 13: Dependence of the {p{y)) results on the rapidities of the hard jets and on the 
choice of whether to exclude the two hardest jets from the median procedure. The left-hand 
plot shows this for our standard rapidity-bin size, 5y = 2, corresponding to Atot = 47r, while 
the right-hand plot shows what happens if we choose smaller bins, 6y = 1, i.e.= Atot = 27r. 
The results correspond to the Pythia DW tune. 
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Figure 14: Left: (cr)/(/o) as a function of rapidity. Right: Sd/ {p) as a function of rapidity. 



result is not fully identical is a consequence of the fact that when the two hard jets are central 
there is an increased probability that the 3rd hardest jet will also be central, thus biasing 
very slightly the central-y bin. 

The right-hand plot of fig. [T3] shows what happens if we reduce the rapidity bin size, 
causing p to be measured in regions of area ^tot = 27r rather than A^oi = 47r. Since the 
impact of the Born particles is inversely proportional to Atot, requiring the Born particles 
to be in the central bin has a noticeably larger effect for the smaller rapidity bin size 
Discarding the two hard jets brings us back to a result that is roughly in accord with that for 
the larger bin size. 

The conclusion from fig. [13] is that if one's event selection does not constrain the hard jets 
to be in the same bin as that used for measuring p and if the bin area is sufficiently large, 
^tot ^ 12, then biases from the hard jets are quite small. In what follows, we will normally 
use j4tot = 47r and leave in the hard jets, in order to keep the analysis as simple as possible. 



4.4.2 Fluctuations 

In fig. [m we examine fluctuations, now only for LHC. The left-hand plot shows the size of 
intra-event fluctuations, through the ratio of {(j{y)) to {p{y)), while the right hand plot shows 
inter-event fluctuations, through Sd{y) / {p{y)) ■ Fluctuations have not been as directly tuned 
as energy flow. Despite this the intra-event fluctuations are very similar across all the Pythia 
tunes and almost independent of rapidity when normalised to {p{y)). Herwig-I- Jimmy's intra- 
event fluctuations are somewhat smaller, but do have rapidity dependence. 

Concerning inter-event fluctuations, fig. [U] (right), the two virtuality ordered (DW/DWT) 
Pythia models are again flat, whereas the pt-ordered shower has increasing fluctuations at 
forward rapidities. Herwig is intermediate between the two sets of Pythia results. 

One observation is that whereas DW/DWT have almost identical intra and inter-event 
fluctuations, Herwig's intra-event fluctuations are nearly 40% smaller than the inter-event 

^^The effect is definitely larger than would be expected based on eq. (|34|) . perhaps a refiection of the non- 
Gaussianity of the distribution of ptj/Aj for the events under consideration. 
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Figure 15: Correlation of p{y2) with 
p{yi), shown function of ?/2 for yi in 
a given rapidity bin: (a) — 1 < y < 1, (b) 
< y < 1 and (c) 3 < yi < 5. In plots 
(a) and (c) p has been determined in bins 
of size 6y = 2, while in (b) it has been 
determined in bins of size 6y = 1. 
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fluctuations. This is reflected also when we examine correlations between p in different parts 
of the event, 

corr(y„ y,) = - (/-(^O) (/-(^^)) (48) 

Sd{yi)Sd(y2) 

as shown in fig. [T21 The correlations are noticeably larger for Herwig+ Jimmy than they 
are for all the Pythia tunes. In determining the correlations it was important that we used 
sufficiently large rapidity bins. Comparing the upper-left plot from fig. [15] {6y = 2) and the 
upper-right plot {6y = 1) one sees that the smaller rapidity bins lead to noticeably smaller 
measured correlations. We interpret this as follows: in small rapidity bins, the "statistics" of 
jets for measuring the p value are more limited. This increases the error on the determination 
of p, thus reducing the maximum amount of correlation that can be observed between different 
bins. 

The final quantity that we examine is the event-by-event distribution of p, fig. \W[ for 
a central rapidity bin (left) and a forward rapidity bin (right). Perhaps the most striking 
characteristic of these plots is the very broad nature of these distributions, which are far 
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Figure 16: The event-by-event distribution of p for a selection of generators, in a central 
rapidity bin, \y\ < 1 (left-hand plot) and a forward rapidity bin, 3 < y < 5 (right). 



from being Gaussian distributions of width Sd centred on (p). The right-hand plot also has a 
significant bin at p = — i.e. there is a substantial number of events for which at least half 
of the jets at forward rapidities are pure ghost jets. 

4.4.3 Energy dependence of results 

Fig. [T7] summarises the energy-dependence of the average energy flow (p) (left) and of the 
intra-event fluctuations {a) normalised to {p) (right) in the central rapidity bin \y\ < 1. The 
features of note are that for (p), the Her wig-|- Jimmy Atlas tune has a significantly steeper en- 
ergy dependence than the DW and SOA Pythia tunes, somewhat more like DWT. For (a) / (p), 
the Pythia tunes have almost no energy-dependence, whereas Herwig has substantial energy- 
dependence. We have not shown the energy dependence of Sd/{p) because it is essentially 
independent of ^/s for all generators. 

5 Conclusions 

The distinction between underlying event and the "hard" part of hadron-collider events is ill- 
defined in QCD. Nevertheless, physically, one may picture the UE as being the component of 
a hadron-hadron scattering that fills the event fairly uniformly with low-p^ radiation. A goal 
of this article was to investigate how different UE-measurement strategies fare in separating 
such a low-pt component from the hard part of the event. 

To do so, we developed a simple toy-model for events, with two distinct components, 
one soft, corresponding to the UE, the other perturbative and hard. Within this model it is 
quite straightforward to establish to what extent a given UE measurement strategy correctly 
extracts just the soft component. 

The two UE measurement strategies that we investigated are the "traditional" approach, 
measuring radiation in regions transverse to a leading jet, and the "jet- area/median" ap- 
proach. Both involve strategies to help separate out the soft and hard components: the use of 
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Figure 17: Level of the underlying event (p) (left) and its intra-event fluctuations {(t) / {p) 
(right) in central rapidity rapidity bin \ y\ < 1 as a function of energy of pp collision. The points 
correspond to the energies of RHIC, Tevatron and a set of possible LHC ^/s values. 



the Av/Min/Max transverse regions in the traditional approach, the use of median activity 
rather than average in the jet-area/median approach. One result from the toy model is a 
quantification of how those strategies fare in the extraction of its soft component. A sec- 
ond result is a determination of the nature of the residual effects due to perturbative (hard) 
radiation. These two results could be expressed analytically in terms of the parameters of 
the measurement procedures (transverse-region area, jet radius, providing useful guidance 
in choosing them) and of characteristics of the hard scattering (notably the value of the 
hard scale and the properties of the soft component). We also examined the question of 
event-to-event fluctuations in the extraction of the characteristics of the soft component. Our 
toy-model results are summarised in section 13.61 

Practically, one conclusion from this work is that for determinations of averaged quantities, 
for example the mean transverse- momentum density per unit area, {p), both the TransMin 
and the area/median measurement methods give a fair determination of the soft component, 
as long as the momentum transfer of the hard scattering is not too large (< 100 GeV for 
the LHC; for higher momentum transfers, the TransMin method is affected by a rapidly 
increasing hard contribution). In particular, for the parameter choices used or advocated 
in the literature, the two kinds of bias seen in the toy model, mismeasurement of the soft 
component and contamination from the hard component, tend to partially cancel each other, 
giving a limited overall bias, of order 20%. In contrast, for event-by-event measurements and 
determinations of fluctuations of the soft component, the traditional approach is significantly 
affected by rare "outliers" , cf. table [TJ 

For this reason, in the full Monte Carlo studies of section H] we concentrated on the 
area/median approach. The results included a validation of the main qualitative prediction 
from the toy model, namely the structure of the i?-dependence of the extracted p, section 
Section 14.4! showed a range of possible observables whose measure we advocate at LHC: 
rapidity dependence of the UE, nature of the event-to-event fluctuations, and intra-event 
fluctuations and correlations. Though existing measurements may indirectly constrain some 
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of these features of the UE, we beheve that they are of sufficient practical importance that 
they deserve dedicated measurements, especially as they differ noticeably between various 
Monte Carlo models. 
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A Toy model UE calculations 

A.l Threshold and asymptotic regions 

With the exponential model for the single-particle pt distribution, eq. dH), there are two 
asymptotic limits of interest: the threshold for I'A — )• ln2 and the asymptotic i^A — )• oo region 
(for brevity here we write A = j4tiie). Writing eq. (|1U|) as an expansion around i^A = In 2, one 
obtains 



i(l + ln2-uA) + '^ ^e-p*/M + o{{uA- In 2f) = ^ , (49) 

2 2 Jq fj, 2 

where we have used eq. (|6ap for dP/dpt and kept only the (^-function term and the first term 
of the sum over n. To first order in uA — In 2, this gives us the behaviour in the turn-on 
region, 

u A — In 9 

A {pe.t) = /X + O {{uA - ln2)V) , (50) 

or, equivalently, using /i = p/u ~ pA/ ln2 for uA ~ In 2, 

■^-^ = 5 + O ((i^A - In 2f) . (51 

p ln^2 ^ ' 

The approximation eq. (jlip reproduces the linear dependence on vA — In 2 at threshold, 
though its slope there, d{pext) /divA) = 2, differs slightly from the exact slope of 1/ln^ 2. 

At large uA, by examining the numerical solutions to eq. ()10p . we have determined the 
following relation for the asymptotic behaviour of pext, 

te..)-M-^+0(^)=p-^+0(^). (52) 

This is reproduced by the approximation of eq. (fTTj) . 

An approximation that is closer still to the full result, with the correct coefficients in both 
limits, is 

X -\- 

Pext = Pz r;; o — i X = vA — In 2 , (53) 

^ '^(ln2)2+X2 + |X ' ' ^ ' 

however the difference between this and eq. (jlip is irrelevant for all practical purposes. 
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A. 2 Variant of toy model 

We can also consider a model with 



^^ = %e-W.. (54) 
Pi dpt //2 V ) 



and correspondingly 

Pndpt fi {2n - 1)1 \ fi J ■ ^ ^ 



This model has the property that a = \J?>v jl^i. For v near In 2 this leads to 

In 2 /-^(P-') 4pj 



^(l-X) + i^^ dp* ^e-^^* + O (X^) = i , (56) 



2 

with X defined as in eq. (f53t) . resulting in 



A (pext) = + g ((^-4 - In2)3/V) , (57) 



or, equivalently, 



At large v one finds, again numerically, 

(p«> = .M-£i+0(^)=p-5^ + 0(^). (59) 

The two above equations both imply that (/Oext) turns on and approaches its asymptotic value 
somewhat faster than in the model with oc e~^*/ ^ j p. k. reasonable analytic approximation 
for (pcxt) over the whole domain is 



which has the correct large-i^ behaviour, and very nearly the correct coefficient for the 
V vA — In 2 turn-on. 



B Fluctuations in area/median extraction of p 
B.l Pure soft case 

To determine the event-to-event fluctuations Sd in the area/median extraction of p when the 
intrinsic event-to-event fluctuations are zero, it is convenient to work in the limit i^^jet ^ 1 
such that the probability distribution of pjct is close to a Gaussian!^ 



'^^We will ignore the issue of fluctuations in Ajet itself, i.e. it is really the tiled case that we consider here 
rather than the full jet case. Nevertheless the good agreement that we find with numerical studies in section 
suggests that it is not illegitimate to ignore jet area fluctuations for our purposes here. 
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The corresponding cumulative probability distribution for the jets is given by 



If the number N of jets is odd, N = 2m + 1, then the probability distribution of the median 
6p is obtained from the product of the probability of having one jet with pjet = p — Sp, rn jets 
with p smaller than this, and m jets with p larger than this: 

^{Sp) = (2m + ^{Sp) [PMSPT [1 - P^edSpr ■ (63) 

Making use of the expansion of the error function 



3^ 



I. (04) 

working in the large m and small 6p/a limit, and making use also of Stirling's formula 
m! ~ \/27rm(m/e)'", one can approximate eq. (|63p as 



dp vrcr \ n J vrcr \ tt J 

where in the last step we have replaced m ~ N/2. Using the relation N (Ajet) = ^tot; we 
finally obtain the following result for the standard deviation of the extracted median p values. 



This is about 25% larger than the standard deviation that would be obtained for p extracted 
as an average of the pjet values over the same total area. This moderate enhancement of 
fluctuations in the pure soft case is part of the price that one pays in exchange for the 
median's greater resilience to hard contamination. 

B.2 Hard contamination 

The result eq. (j34p for the average discrepancy in pext due to hard contamination can be 
obtained in an alternative manner, which will be more useful for estimating fluctuations. In 
this approach we imagine some distribution of soft jets, and then add in the hard partons. 
Some of the hard partons will enter jets whose pjet is already above the median value for p. 
These hard partons will not affected /Oext- The remaining hard partons (a number k) will enter 
jets that were below the median. These jets will acquire much larger transverse momenta, 
taking them well above the median. Thus it becomes necessary to recalculate the median, 
which will be shifted by k soft jets' worth. From eq. (j6ip . and working in the large- limit 
(throughout this section), this translates to an average shift in pext of 




{6p) = k5u 5i^-^lj^, (67) 
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where 5i is the 1-jet shift. Substituting (k) = (rih) /2 gives 



in accord with the first order term in (uh) /N in eq. 

We win consider two main sources of fluctuations in this result. Let us first imagine that 
k = 1. The median wiU shift up by one jet, and the distribution of 5p wih be simply be 
given by the distribution of the difference in pj^t between two neighbouring jets in the sorted 
sequence of jets (at position in the sequence that is near the median). That distribution is 
an exponential distribution with mean 6i, 

(iP 1 

_(fc = l) = -exp[-5p/5i] . (69) 
The distribution of the shift for k jets is 



d5p^ ' k\{6i] 



— ik) = iJ7T^^w[-Sp/Si] , (70) 



with the standard deviation ^/k6i. 

The second source of fluctuations comes from the fact that k is not constant but rather 
has a Poisson distribution with mean (uh) /2 and standard deviation \/(nhJ720 Combining 
this with the fluctuation on p for fixed k, leads to an overall standard deviation of 



^(hard) n — r. / 2vr / ^/juh) ( {nhf^^ \ \ 

^.,.ed = V{nH)Si = ^ I + O I 1 1 , (71) 



In terms of R and Atnt, this becomes 



^tot \ ^tot 



« = I ^ + O I ^^^^^^ 1 1 (T2a) 



where L ~ 1 was defined in eq. ([35 
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